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A  NOTE  ON  ASYMPTOTIC  JOINT  DISTRIBUTION  OF  THE  EIGENVALUES 
OF  A  NONCENTRAL  MULTIVARIATE  F  MATRIX 

by 

Z.  D.  Bai 


Center  for  Multivariate  Analysis 
University  of  Pittsburgh 


,  -  .  ABSTRACT 

7)0-,,  ei'It'-tjL,  LojibZ  K  //J(/ 

“  §  '  1  rfh 


J  ^ 

In  m-fc-r  Hsu-jJ^endw-^ath.  Soc.  19414.  the  proof  of  the  basic 
Lemma  3  is  based  on  Lemma  1  which  Is  wrong.  The  aim  of  this  <note  is  to 


correct  the  prQof  of  Lemma  3  and  consequently  to  ensure  the  main  theora* 

Hs  wV  '  j  Lo*  ,, 

In-tMrr-tistrfy.  tendoft-Mathr  -Soc7~~1 941 ) .  .  rr/s-  r/cez-CM 


J 
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c- 


Kev  Words;  Limiting  distribution,  multivariate  F  matrix,  eigenvalues  of 
random  matrix 
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2. 


1.  INTRODUCTION 


Consider  k  p-variate  normal  populations  with  mean  vectors  (£1jf  £2j .  *(*> 

»  •  1.  2 . k  and  a  common  covariance  matrix  £  •  ||  0|j  ||.  Now.  let  £•  •  (£u.  ^2i-^pi)  an<^ 

—  -i 

^11'  ^21*“"^p1'  ^  ! 

C  *  Z\2‘  522,,,,,^p2'  * 


i 

j^lk'  ^2k'*"'^pk'  fv 

The  geometrical  meaning  of  the  rank  of  £  is  obvious,  if  its  rank  is  one.  the  k  centroids  gP 
the  k  populations  are  coincident,  if  it  is  two.  the  centroids  are  coilinear  but  not  coincident 
and  so  on.  So.  the  rank  of  £  is  important  in  certain  problems  of  inference  in  the  area  o£ 
multivariate  analysis. 


Suppose  there  are  k  samples,  with  sUe  m1#  m2....,m|t,  respectively,  drawn  from  the  K 


populations.  Let 


*1  •  J  l.  mt*i« 

n  t-1 

*i|  ■  fj  j,  "V«it  -  tiH5„  -  5,)' 


N  -  £  m, 
t-1 


it  is  not  difficult  to  see  that  the  rank  of  the  matrix  ||  tyj  ||  is  one  less  than  that  of  £ 
On  the  other  hand,  the  rank  of  ||  fyj  (J  is  in  turn  equal  to  the  number  of  positive  roots  op 
the  determinants!  equation 


|  V|j  -  Xa,j  |-0  3) 

because  the  matrix  ||  fyj  |j  is  nonnegative  definite  and  £  -  ||  Ojj  ||  is  positive  definft% 


Thus,  the  problem  to  investigate  the  rank  of  5  turns  out  to  be  the  problem  to  investigate 
the  number  of  positive  roots  of  (3).  A  review  of  the  literature  on  testing  for  the  roe*  of 
is  given  in  Krishnaiah  (1982). 

Now.  let  us  consider  the  samples  drawn  from  the  k  multivariate  normal  populations. 

Let  (x1t . xpt)  denote  the  mean  vector  of  the  t-th  sample  and  (x1,...^p)  denote  the  mean 

vector  of  the  grand  sample,  9jjt(i.  i  •  1.  2 . p)  the  second  moments  about  the  means  of 

the  t-th  sample.  Write 

k 

i,j-  I  mt(x|t  -  x,)(xjt  -  xj) 
t-1 

k 

bjj  ■  l  nH®,jt 
t-1 

.  •  min  (p,  k  -  1) 

l2  •  max  (p.  k  -  1). 

It  is  obvious  that  the  matrix  ||  ei|j  |)  is  nonnegative  definite  and  has  rank  a«d  that 
the  matrix  ||  ||  is  positive  definite  provided  that  N  -  k  >  p.  Hence  the  deterfflfiamal 

equation  in  $ 

I  «||  -  I  ■  0  (4) 

has  a  root  zero  of  multiplicity  p  -  and  positive  roots. 

The  nonzero  roots  of  (4)  play  an  important  role  in  discriminant  analysis*  Their 
distribution  depends  solely  upon  the  roots  of  (3),  and  their  exact  distribution  is  frown  in 
the  case  that  where  all  roots  of  (3)  are  zero  (see  Fisher  (1939).  Hsu  (1939).  and  Hoy 
(1939)).  In  the  general  case.  Hsu  (1941)  obtained  the  limiting  distribution  of  these  positive 
roots  and  it  is  given  in  the  following  theorem. 

Theorem  (P.  i.  Hsu).  Suppose  the  sample  sizes  satisfy  the  condition  mt  -  mqt,  t  -  I  2  -  k. 


Let  q  »  f  and  N  •  mq  (not*  that  the  roots  of  (3)  is  independent  of  m  in  ths  present 

t*1 

case).  Suppose  that  (3)  has  positive  roots  Xj  >  X2  >~>XV  >  0  with  multiplicities 
UV  U2 . Uy,  respectively.  Write 


Sq  •  0,  ah  *  ah-1  ♦  Ufy  h  •  1,  2 . v,  r  ■  i^. 


Also  writ*  the  positive  roots  of  (4)  as  >.  42  2.-^4 .  >  D*fin* 


Cj  -  /fi(2Xg  ♦  4Xj|)  2(4|  -  Xh).  (i  -  ah.1  ♦  h  •  1.  2.....V) 


Ci  •  **4* 


(I  •  r  ♦  1 — t1) 


Then  the  limiting  distribution  density  (as  m  ♦  •)  of  is 


where 


D(Xj  • » •  Xy) 


J,  3-L(Xi  ■  v}-pH  j,1' 


<"  *  *!  -1  *2  -  •••  -  V 


>i<vi . 't,>  -  (i)T<'>’t><°l'r)’7<,i’r>{l;1  «K  -  r  - 1 +  * 


<11 

'l-r+l  J-i+1  1  J  1 1 1-r+l  !• 


+  ‘i  «,  ] 

*  i-r+1  1  J 


(-  >  >  0,  nx  -  k-  1). 

Unfortunately,  Lemma  1  In  the  paper  of  Hsu  (1941)  Is  not  correct.  The 
main  result  of  Hsu  Is  based  upon  thm  above  lemma.  Recently,  Prof.  tf.  Q.  Lien 
found  this  mistake  and  gave  a  counterexample.  The  purpose  of  this  note  is, 
according  to  the  suggestion  of  Professor  K.  L.  Chung,  to  give  a  new  proof 
of  Lemma  3  in  Hsu’s  paper;  this  lemma  plays  a  key  role  in  the  proof  of  the 


main  theorem  of  the  result  of  Hsu. 
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2.  A  COUNTEREXAMPLE 

To  clearly  understand  the  counterexample,  we  have  to  restate  here  the. 
Lemma  1  In  Hsu  (1941) . 

Let  Q  (n  -  1,  2,...)  be  a  random  point  with  a  finite  number.  Independent 
n 

of  n,  of  coordinates,  and  let  its  domain  of  existence  be  the  Borel  set  E  . 

n 

Let 

and  put 

11m  E  -  E. 
n 


Let  the  probability  of  Qq  approach  a  limiting  probability  function,  which 
is  continuous,  as  n 

Let  f  (P)(n  -  1,  2,...)  be  a  real  point  function  defined  and  Borel 
n 

* 

measurable  in  E  .  Let 
n 


llm  f  (P)  -  0  throughout  E 
n^»  n 

then  the  random  variable  f  (Q  )  +  0  in  pr.  as  n  +  ». 

n  xn  r 

Example  1.  Let  pR  denote  the  n*^1  prime  number,  and  let 
Define 


f„CP) 


otherwise. 


r 


p 

p 


Evidently,  we  have 

fn(P)  ♦O.Vpc  (0,1). 

Define  random  variable  Q  with  its  distribution  function 


k  (x)  p  +  1  fx  .  ... 

„  <  *>  ■  -TT  * -fc—  LI<°-‘>'u)du 


p„(x)  •  HQ 


where  k  (x)  is  the  number  of  elements  of  F  ,  which  are  less  than  x, 
n  n 

denotes  the  indicator  function  of  the  set  A.  It  la  obvious  that  F 

n 

R(0,1) ,  the  uniform  distribution  over  the  interval  (0,1),  as  n  •. 

know  that  E  •  E  •  (0,1),  n  -  1,  2,...  .  But  we  evidently  have 
n 


and  I , 

A 

tends  to 
Also, 


6 


1 


6 


P(f  (Q_) 

n  n 


1)  -  P(QQe 


Fn) 


Pn  : 

2P. 


1 

2* 


In  fact,  we  can  construct  an  example  in  which  all  the  f^(P) ,  n  »  1.  2.-. . 
are  continuous  and  all  the  distributions  of  n  ■  I,  2,..»,  are  absolute]^/ 
continuous,  but  the  conclusion  of  this  Lemma  is  not  true.  On  the  other  htni, 
we  can  show  that  if  each  Q  has  a  probability  density  q  and  q  tends  to  C 

n  u  u 

limit  density  q,  then  the  conclusion  of  this  lemma  will  be  true. 

But,  we  omit  the  details  since  the  main  purpose  of  our  paper  is  to  five 
a  correct  proof  of  Hsu's  theorem  and  not  to  give  details  of  counterexamples 
of  his  lemma. 


% 


V.  V-  \.\ -  -  v  a  v\vW\M 


3.  PROOF  OF  HSU'S  THEOREM 


Write 


«  k  -  1,  n  ■  N  -  k,  v  *  N 


1 

2 


lyi*?**  “n  ’  -=  <  1  *iv  * N) 


V-l 


In  [2],  it  is  correctly  proved  that  the  distribution  of  the  positive 
roots  of  (4)  is  the  same  as  that  of  the  following  determinants!  equation 


where 


|v  A  +  vC  -  v$U  +  D|  ■  0 


'V'  • u 


^11 . “lp 


u*  .....v'fu 

pi  pp 


<X1  -  ♦>*. 


‘•(Xv-^V 


-  ♦! 


P-r 


cn  •••  C1  Ei 


'lv 


•  •  •  c 


vv  v 


E1  •••  Ev 


°hh  yij  +  yjill  •  *•  1  •  *h-t  +  1 . V  h  *  *’ 2i 

chg  •  nv5t, yji  +/tg  yy11 .  i>  *  s.  i  -  vi + 1 . v 

j  *  Vi + 1 . *g’ 

*h  ■  njll  •  1  *  r  +  1 . .  i'Vi*  1 . V 


..  ,v. 


and  {y  ,  z  }  has  joint  probability  density 
i  j  j  ^ 


1. 


-  -rp(n  +n) 

(2ir)  1  exp 


(-yd  l'y  2t}+  l  ?.«?. 


i-i  j«i 


i*l  v-i 


iv 


)> 


10) 


In  Lemma  2  of  [2],  it  is  shown  that  all  the  u^'s  tend  to  i.i.d.  N(D  l) 's. 


Though  Lemma  1,  which  is  not  true,  was  used  in  the  proof  of  Lemma  2,  the 
correctness  of  Lemma  2  is  easily  seen.  Its  proof  can  he  easily  modified 
and  is  omitted.  The  reader  can  refer  to  Lemmas  9  and  10  in  X.  R.  Chen  (|48f)> 


According  to  this  lemna,  the  u^'s  *-n  (9)  was  replaced  by  a  set  of  a»>,d. 


N(0,1)  variables  {w^}  (though  this  was  not  obviously  stated).  But  the  correct¬ 


ness  of  this  approach  would  not  be  easily  seen.  For  this,  we  need  the  following 
lemma. 


Lemma  1.  Let  d  be  a  positive  integer  and  Q^,  Q  be  probability  measured 

defined  on  (R**,  8(R^))  such  that  Q  Q.  Then  there  is  a  probability  spdce 

»  n  • 

(0,  F,  P)  on  which  we  can  define  a  sequence  of  random  vectors  (X^)  and  X  *>*ch 

that  X  (w)  -►  X(oj),  ¥  u  e  0,  and  that  X  and  X  have  distributions  F  and  F, 
n  n  n  • 

respectively. 

In  fact,  this  lemma  can  be  extended  to  the  weakly  convergent  sequence  of 
probability  measures  defined  on  a  complete  and  separable  metric  space. 

Because  the  proof  is  rather  long  and  there  are  some  interesting  applicai i ofls 
of  this  lemma,  we  will  discuss  it  in  detail  in  a  separate  paper  (See  3*1  Ogd 
Liang  (1984)  [4]). 

Applying  this  lemma,  we  can  define  {u£^  ,  w^;  i,  j  -  1,  2,...,p, 

N  -  1,  2,...}  on  some  probability  space  such  that  {u.^ }  P°*ntw*s.e.»  {w  4 
N  -*■  •,  (u^}andi{u^}  are  identically  distributed  and  that  {w^}  is  a  set* 
of  i.i.d.  N(0,1)  random  variables.  Since  {y^}  i*  Independent  of  {uij)>*  we 
can  also  assume  {y^}  is  independent  of  {u^>.  For  the  sake  of  simplicity 
of  relation,  we  still  use  (u^}  Instead  of  {u^}.  This  is  to  say,  we  4$$ume 


that 


u  w 


11  W1 


w  ,  ...  VT„ 
pi  pp 


V  u  e  n 


(Note  that,  for  different  n,  all  the  U's  do  not  still  have  the  relations 
determined  by  (8)  and  (10).) 

Lennna2.  Let  K  >  k  be  two  positive  integers.  Suppose  that 

fR(z)  -  a^n)zK  +•*  *+aon^  "*■  "  ®kzk  +,,,+ao»  where  0,  afc  4  0,  n  =  |. 

Let  denote  the  roots  of  f.  Then  we  can  suitably  arrange  the  K  rdocs 

of  fR  as  z . . .  ,z<n)  ,  z£"j ....  .z^  such  that 


U<n)l  ♦ 


for  i  <  k 


for  i  >  k 


as  n  -*■  ®. 


Proof.  If  K  >  k,  then  a£n^  -*•  0  and  a£n^  a^  +  0,  hence  |a£nVa£n^|  -*■  <*>.  By 
Weida  Theorem  there  must  be  a  sequence  of  roots,  say  z£n^  which  tends  to 
infinity.  Thus,  there  must  be  a  K  -  l  degree  polynomial  p£"^(z),  for  each  ft, 

such  that  fn(z)  ■  (1 - 7n)^PK-l^ *  Noting  fn  *  f*  we  get  PK-1^ 

ZK  ~  ~  (n) 

as  n  -*•  ».  By  induction,  we  can  find  that  there  are  K  -  k  roots  zj  ' , 

j  -  k  +  1,...,K,  such  that  | z |  -*>  »,  also  there  is  a  k-degree  polynomial 

p£n^(z)  -*•  f(z),  and  all  the  k  roots  of  P^n\z)  are  the  remaining  roots  of  -p^ . 

We  claim  that  there  must  be  a  root  of  P^n^(z),  say  z^n\  such  that 

z|n^  z i  as  n  -*•  *.  Otherwise,  there  must  be  a  positive  number  such  that 

£<klz‘n)'  ZM  i  ««  >  « 


-  "0 


holds  for  infinitely  many  n.  Let  P^n^(z)  ■  b^n^  H  (z  -  z^n^),  where  b5n^  ifi 

*  K  i-1 

the  coefficient  of  first  term  of  P^n^ (z) .  On  one  hand,  P^n^(Zj)  +  0  ai  ne«, 
since  P^(z,)  f(z.)  ■  0,  as  n  +  •.  On  the  other  hand. 
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/0. 


|p^U)|  -  |b‘n)l  ;  |.,-«<n)|  »  1^1 

1*  1 

for  infinitely  many  n. 


Note  |b^  |  ■*  ja^l  >  0  and  we  get  a  contradiction.  Thus  our  assertion  is 
proved.  By  decomposition  theorem  of  polynomial  there  is  a  k  -  1  degree 


polynomial  P^J(z)  such  that 


P<n)Cz)  -  (z  -  z<">)P^>(z) 


Pk-P2^  ^  Pk-1^  as  n 


where  ^(z)  is  a  k  -  1  degree  polynomial  such  that 


f(z)  -  (z  -  z  j)Pj^_  x  (z)  . 


By  induction,  ve  prove  the  Letmna. 

Split  W  into  blocks  according  to  the  fashion  of  split  of  C.  Write  +h« 
blocks  of  W  as  .  Since  U  -►  W,  ¥  u  e  by  lemma  2  we  know  that  the 
positive  roots  ^  (excluding  the  p  -  2.^  multiplicity  of  root  zero)  satisfy 

^  +  0(1),  i  -  ah_1  +  i,...,a^,  h  •  1,  2,...,v  +  1  <12 

vdiere  X  ^  -  0,  a^.  -  l Set  A  “  min  (X,  -  X.+j)  >0.  In  what  follows, 

l<h<v 

we  fix  w  c  0.  Substituting  $  •  X^  +  vq  into  the  lefthand  side  of  (9), 

1/2 

dividing  by  v  the  first  rows  and  the  first  y^  columns  of  the  deterisin- 
antal  equation  in  (9)  ,  and  letting  N  -*•  «,  we  find  the  lefthand  side  of  <0 
tends  to 


C11  “  X1W11  “  nIy. 


(*2  ~ 


•(X  -  X,)IU 

v  1  Wv 


-  X.I 

i  p-r 


which  is  a  y, -degree  polynomial  in  n  and  whose  roots  are  the  same  as  t|Mt 
of  the  following  equation. 


'.V 


By  (12)  we  know  when  N  large  enough 

1*1  “  *lJ  *  1  *  *h-l  +  1»***»V  h  ■  1,  2,...,  v  +  1. 

Write  ■  (4t  -  A^/v  and  denote  by  rij  .>  n2>...>  n  >0  the  roots  of  C 13) 


Then  we  have 


|n±|  <  1  -  1,  2 . Uj  -  aj 


InJ  <  -  2A/3v  1  -  aj  +  1 . 11 


By  Lemma  2  we  know  that 


nl  ■*  V  1  *  1 »  2 . *  a! 


1  ■  at  +  1 . tj. 

Consider  ghe  Uj  *  Uj  matrix  -  AjW^.  **»•  diagonal  elements  of  efllg 
matrix  are 

*Vu  -  'Vtt  -  *7  +  4J^  N(0 , 1) , 

and  the  off-diagonal  elements  are 

^Vyij  +  yiJ)  ’  X1U1J  *  ^X1  +  2X1 

Hence,  the  distribution  of  roots  of  (13)  Is  the  same  as  that  of 

+  2Aj  wH  -  nl|  -  0.  0- 

Set  -  n±lJl xf  +  4Aj  — ►  4Aj,  1  *  1,  2, . . . ,Uj ,  as  N  * • 

Following  the  same  lines  as  the  proof  In  [2],  the  distribution  .of  g  ,  1  -  I, 
tends  to  that  as  stated  in  Hsu's  Theorem. 

Similarly,  we  can  prove  that  if  we  write 

"i  *  (*i  -  V/v-  1  *  Vi  + 1 . v h  ■ ». 2 . . 

then 

«  N  • 

where  1  •  a^  +  1 . a.  ,  ere  the  roots  of  the  determinants!  equation 


12 


where  A  is  the  lower-right  (p  -  r)  x  (p  -  r)  submatrix  of  A.  (15)  is  equi¬ 
valent  to 

det(A  -  tU  E'  -  Cl)  «  0.  (16) 

A1  1  1  v 

On  recalling  the  elements  of  A  and  E^,  (16)  is  in  fact  the  following  equation 

det  le^  -  Ui:J||-  0  (17) 

where  6^  ■  1,  6^  ■  0  (for  i  j*  j)  and 

nl 

*«  ’s-Lr^e' 

Urite  the  roots  of  (17)  as  *•••»?£  »  0,...0. 

is  -  r,  hence  (17)  has  p  -  ij  multiplicity  of  root  zero.)  Set 
-  2 

Cf  ■  ♦  /v  ,  i«r+l . ij.  By  Leona  2  we  know 

4  i  *  r  + 

According  to  the  proof  of  Hsu,  we  know  the  distribution  of  i  *  f  +  1».. 
tends  to  that  aa  stated  in  Hsu's  Theorem. 

The  Independence  among  each  group  of  limits  of  the  corresponding  grasps 


(Note  the  rank  of 


of  roots  is  obvious . 


The  proof  of  Hsu's  Theorem  is  thus  completed 


i 


^W3KS3B3PaB^aaaBaggngrarT?»  n»gqf!’ 
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